Font Recognition Using Shape-Based Quad-tree and Kd-tree Decomposition
نویسندگان
چکیده
The search for appropriate data representations and visual features for content-based image retrieval continues within the computer vision community, alongside the development of new matching and indexing techniques to facilitate fast search in large-scale image databases. In this study, we present a solution to the problem of typeface identification and character recognition in text-based images using this type of approach. Geometrical properties of a character are extracted from its binary image at different levels of spatial resolution, via a hierarchical abstraction of the image data. Two such abstractions are described here: a shape-based quad-tree, and a kd-tree. Unlike the traditional quad-tree representation in which an image is generally partitioned into 4 blocks of equal size at each level of decomposition, the block size in the centroidbased quad-tree is variable, being determined by the location of the centre of gravity of the regions represented in a sub-image. In a similar way, the kd-tree partitions the image data into two, again about an axis defined by the shape of the region represented in the image. Weighted and non-weighted feature vectors of the partitioning points are then used within a metric tree to index character images in a font database. We discuss factors that influence the performance of the resulting font retrieval system, both in terms of accuracy and speed.
منابع مشابه
Font Distribution Observation by Network-Based Analysis
The off-the-shelf Optical Character Recognition (OCR) engines return mediocre performance on the decorative characters which usually appear in natural scenes such as signboards. A reasonable way towards the so-called camera-based OCR is to collect a large-scale font set and analyze the distribution of font samples for realizing some character recognition engine which is tolerant to font shape v...
متن کاملQuad-tree Decomposition Based Image Analysis Using Intensity Difference Threshold
The watermark bit assignment controls the watermark embedding strength and the bit plane selection. Quad-tree decomposition is used as analysis technique which involves subdividing of image into blocks that are more homogeneous in comparison to the image itself. This technique gives the information about the structure of the image. This is the first step in adaptive compression algorithms. For ...
متن کاملA Tutorial on Spatial Data Handling
Spatial data is data related to space. In various application fields like GIS, multimedia information systems, etc., there is a need to store and manage these data. Some datastructures used for the spatial access methods are R tree and its extensions where objects could be approximated by their minimum bounding rectangles and Quad tree based structures where space is subdivided according to cer...
متن کاملA novel stereo image coder based on quad-tree analysis and morphological representation of wavelet coefficients
In this paper, we propose a novel stereoscopic image coder, which consists of a coding unit based on the morphological representation of the wavelet transform coefficients and a disparity compensation unit based on the quad-tree analysis and the disparity compensation between the images of a stereo pair. The coding unit employs a Discrete Wavelet Transform followed by a morphological coder, whi...
متن کاملFractal Image Compression Using Quad Tree Decomposition & DWT
Image compression plays an important role in our day to day activities. Image compression is the process of reducing the amount of data required to represent given quantity of information in image to reduce storage requirements and many other reasons. In a computer image is represented as an array of numbers, integers and is known as digital image. Image array is mainly of two dimensional and t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011